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ABSTRACT 

We use a series of statistical techniques to compare the clustering of samples of IRAS 
galaxies selected on the basis of their far-infrared emission temperature, to see whether 
a temperature-dependent effect, such as might be produced by interaction-induced star 
formation, could be responsible for the increase in clustering strength with redshift in 
the QDOT redshift survey that has been reported by several authors. 

The temperature-luminosity relation for IRAS galaxies means that warm and cool 
samples drawn from a flux-limited sample like QDOT will sample quite different volumes 
of space. To overcome this problem, and to distinguish truly temperature-dependent 
t_. results from those depending directly on the volume of space sampled, we consider a 

pair of samples of warmer and cooler galaxies with matched redshift distributions, as 
QQ well as pairs of samples selected using a simple temperature cut. 

We find that the redshift space autocorrelation function of warm QDOT galaxies 
is significantly stronger than that of cool galaxies on large scales, but that this difference 
disappears when we come to consider the warmer and cooler samples with matched 
redshift distributions. A counts-in-cells analysis reveals no significant difference between 
the clustering of the warm and cool QDOT samples, while the use of a new, symmetric 
^-H estimator reveals that the cross-correlations of warm and cool IRAS galaxies with Abell 

l/^ clusters do not differ significantly. A higher signal-to-noise test is provided by computing 

On the projected cross-correlations of the matched samples with the parent two-dimensional 

*^ catalogue from which QDOT is drawn and this does yield a marginal detection of greater 

O ^ large scale power for warmer galaxies. A direct comparison of the distributions of the 

I warmer and cooler samples, using a new technique which tests the null hypothesis that 

O they are drawn from the same population, reveals that the two classes of galaxy do 

cluster differently on small scales in redshift-space, while their ^(cr, tt) plots suggest 
that the apparent concentration of more warm IRAS galaxies into richer environments 
refiects the fact they sample richer volumes of space within the QDOT survey, rather 
than illustrating a correlation between temperature and richness. 

We conclude that there may be a temperature-dependent component to the 
observed increase in the clustering strength of QDOT galaxies with redshift, but that 
it is less important than a sampling effect, which refiects the local cosmography, rather 
than the physical properties of the galaxies and their environment. We discuss the 
implications of this work for the use of IRAS galaxies as probes of large-scale structure 
and for models accounting for their far-infrared emission by interaction-induced star 
formation. 

Key words: Galaxies: clustering; cosmology: large-scale structure of Universe. 



1 INTRODUCTION 



IRAS offers the cosmologist excellent sky coverage, together 
with uniform flux calibration, good positional accuracy and 
insignificant Galactic absorption. These factors combine to 
make the distribution of IRAS galaxies a potentially impor- 



tant cosmological probe and one that has been employed by 
many authors in recent years. 

Implicit in the use of IRAS galaxies as large-scale struc- 
ture probes is an assumption about the homogeneity of their 
properties, at least in so far as these properties may be corre- 
lated with environment and, hence, with clustering strength. 
The overwhelming majority of galaxies observed by IRAS 
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are spirals (e.g. de Jong et al. 1984), which are seen in 
preference to early-type galaxies because they contain more 
dust to re-radiate energy from galactic sources into the far- 
infrared. There are several mechanisms through which this 
can take place. Rowan- Robinson fc Crawford (1989, here- 
after RRC89) have modelled the IRAS spectra of galaxies 
as being composed of three components: a cool 'disc' com- 
ponent, peaking in the 100/tm IRAS passband, due to the 
re-emission by interstellar dust grains of energy absorbed 
from the ambient galactic radiation field; a warm 'starburst' 
component which is produced by optically-thick dust clouds 
surrounding regions where massive stars are currently be- 
ing formed and which peaks in the 60/tm band; and a hot 
'Seyfert' component, peaking in the 12 and 25/tm bands 
and identified with emission from dust clouds surrounding 
compact power-law continuum sources. They argue that the 
IRAS spectra of their sample of 227 galaxies are well ex- 
plained in terms of varying fractions of these three com- 
ponents, with the great majority of IRAS galaxies being 
dominated by the 'disc' and 'starburst' components. Sim- 
ilar models of IRAS emission have been considered by de 
Jong et al. (1984), Helou (1986), Telesco, Wolstencroft & 
Done (1988) and Bothun, Lonsdale & Rice (1989) amongst 
others. 

The model presented by RRC89 is a simple one, but its 
basic notion of the far-infrared emission from IRAS galax- 
ies being a combination of ambient cirrus emission and lo- 
calised emission from active star-forming regions is prob- 
ably correct. The relative contributions of these two com- 
ponents to the emission from a particular galaxy may be 
quantified by a dust emission temperature deduced from 
its 60 and 100/tm fluxes. Saunders et al. (1990, hereafter 
S90) have determined the 60/tm luminosity functions for 
warm and cool subsamples of the QDOT redshift survey of 
IRAS galaxies (Lawrence et al., in preparation): they define 
their subsamples on the basis of the RRC89 models, which 
predict that the emission from galaxies with dust temper- 
atures above 36 K (classified as 'warm') is dominated by 
the 'starburst' component, while the 'disc' component dom- 
inates in ('cool') galaxies with emission temperatures be- 
low 36 K. They find that the overall luminosity function of 
IRAS galaxies is dominated by cool galaxies for luminosi- 
ties below ~5xlO^/t~^iQ (the solar luminosity, Lq, is 
Lq = 3.826 X 10'^'^ erg s~'^ and the Hubble constant. Ho, is 
written as Ho = 100 A km s~^ Mpc~^) and by warm galaxies 
above that level. 

An obvious consequence of this observed temperature- 
luminosity relation for IRAS galaxies is that there will be 
a radial gradient in the local mean galactic dust emission 
temperature in a sample of IRAS galaxies selected above a 
60/tm flux limit: as we show in Fig.l, the galaxies selected 
above the QDOT flux limit of 0.6 Jy become, on average, 
warmer with increasing redshift. 

This radial temperature gradient in QDOT appears to 
be mirrored by a similar gradient in clustering strength. 
Feldman, Kaiser & Peacock (1994) find that the ampli- 
tude of the redshift-space power spectrum of the QDOT 
survey increases by up to a factor of 1.5 if they give more 
weight to more distant galaxies, and show that this is not 
due to a luminosity-dependence to the clustering strength: 
this is confirmed by the redshift-space correlation func- 
tion analysis of Moore et al. (1994). Could this clustering 
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Figure 1. The radial temperature gradient in QDOT. 



strength gradient be connected with the temperature gradi- 
ent, with the population of galaxies in QDOT changing with 
increasing redshift, becoming dominated by warmer, more 
strongly clustered galaxies? In particular, could this chang- 
ing population become more concentrated in richer environ- 
ments, as Mo, Peacock & Xia (1993) have shown that the 
cross-correlations between QDOT galaxies and Abell clus- 
ters strengthen with redshift? 

Simple notions of interaction-induced star formation 
may provide a link between the temperature of an IRAS 
galaxy and the richness of its environment and, hence, be- 
tween the temperature and clustering strength gradients. 
There is a growing body of evidence linking the dynam- 
ical interactions between galaxies with the starburst phe- 
nomenon of enhanced star formation. Theoretical simula- 
tions of such events (e.g. Barnes & Hernquist 1991 and ref- 
erences therein) have shown how interactions can result in 
the gas in the interstellar media of the interacting galaxies 
losing angular momentum and the resulting infall of large 
quantities of gas could provide the conditions required to 
produce a burst of star formation. The relationship between 
interactions and mass infall is, however, a complicated one, 
depending on such factors as the closeness of approach and 
the orientation of the galaxies' axes of rotation relative to 
the direction of their orbital motion (e.g. Mihos et al. 1992). 
Observational evidence is equally forthcoming (see, for ex- 
ample, Sulentic, Keel & Telesco 1990 and references therein). 
Bushouse, Lamb & Werner (1988) and Xu & Sulentic (1991) 
have shown that samples of IRAS galaxies selected on the 
basis of morphological indications of interactions in the opti- 
cal have higher far-infrared luminosities than samples of iso- 
lated galaxies showing no such features. Surace et al. (1993) 
find that while infrared properties alone are insufficient to 
distinguish clearly an individual interacting galaxy from an 
individual isolated galaxy with the same blue luminosity, 
due to the intrinsic dispersion in galactic properties, it is 
seen that samples of interacting galaxies have substantially 
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warmer far-infrared colours than samples of isolated galaxies 
with the same distribution of blue luminosities. 

One might naively expect that if it is the two-body pro- 
cess of galaxy interaction that is responsible for the star- 
burst phenomenon of enhanced star formation, then star- 
burst galaxies should be found preferentially in regions of 
high galactic density and, thus, one might expect there to be 
a relationship between the dust emission temperature of an 
IRAS galaxy and the richness of its environment, producing, 
in turn, a trend of increasing clustering strength with red- 
shift, mirroring the temperature gradient. The high velocity 
dispersions in the cores of rich clusters will, however, prevent 
mergers taking place there and, so, limit the enhancement 
of clustering strength brought about by this effect, which we 
advance only as a plausible physical way of producing cor- 
related temperature and clustering strength gradients. The 
widespread use of flux-limited samples of IRAS galaxies as 
cosmological probes does, however, strongly motivate a thor- 
ough investigation of the possible effect on large-scale struc- 
ture studies of the differing 60/tm luminosity functions of 
samples of warm and cool IRAS galaxies, and that is our 
principal concern here. 

We investigate the possible existence of such an ef- 
fect by comparing the clustering strengths of subsamples 
of warm and cool IRAS galaxies in this paper: the disper- 
sion in the temperature-60/tm luminosity relation for IRAS 
galaxies means that this is not redundant in the light of pre- 
vious searches for luminosity dependence in the clustering 
of QDOT galaxies. 

The plan of the remainder of this paper is as follows. In 
Section 2 we describe the construction of warm and cool sub- 
samples from QDOT and the determination of their redshift- 
space selection functions. The redshift-space autocorrela- 
tions of these subsamples are the subject of Section 3, while 
a complementary counts-in-cells analysis is given in Section 
4, and, in Section 5, we probe the real-space clustering of 
the warm and cool samples through their projected cross- 
correlations with the galaxies of the QMW IRAS Galaxy 
Catalogue (QIQC: Rowan- Robinson et al. f99f). Section 6 
makes a direct comparison of the redshift-space distribu- 
tions of warm and cool IRAS galaxies, through the use of a 
new technique to test whether the warmer and cooler sam- 
ples are drawn from the same population. We investigate the 
possibility that the warm galaxies are preferentially located 
in richer environments through studying the anisotropy in 
{((7, 7r) in Section 7, and the cross-correlations of warm and 
cool samples with Abell clusters in Section 8: the latter is 
conducted using a new cross-correlation function estimator, 
whose use we justify in the Appendix. Finally, in Section 9 
we discuss the results of this project and present the con- 
clusions we draw from them in Section fO. 



2 DATA SAMPLES 

2.1 The QDOT IRAS galaxy redshift survey 

We consider warm and cool subsamples drawn from the 
QDOT IRAS galaxy redshift survey (Lawrence et al., 
in preparation). This comprises redshifts for a one-in-six 
sparse sample of galaxies drawn from the QIGC, limited to 
\b\ > 10° and above a 60/tm flux limit of 0.6 Jy, which marks 



the completeness limit of the IRAS Point Source Catalog 
(Chester, Beichman & Conrow 1987). We use the 1993 revi- 
sion of the QDOT catalogue, in which the redshifts for ~200 
southern galaxies afflicted by a wavelength calibration error 
in earlier versions of the catalogue have been corrected. 

2.2 The Mask 

The standard QIGC/QDOT mask excludes from analysis 
that portion of the sky (about 4 per cent) missed by IRAS, 
plus the region \b\ < 10° and various other 'lune bins' (1° xl° 
regions defined in ecliptic coordinates) where high source 
density confuses discrimination between Galactic and ex- 
tragalactic sources. Our purposes, however, necessitate the 
exclusion of further lune bins, as we require accurate tem- 
perature determination on the basis of 60 and 100/tm fluxes. 

Not all QDOT galaxies have confirmed detections at 
100/tm and the variation of the 100/tm Galactic cirrus emis- 
sion (e.g. Low et al. 1984) across the sky results in a cor- 
responding variation in the upper limits assigned to sources 
in the absence of confirmed 100/tm detections. This situa- 
tion may be greatly ameliorated by the exclusion of lune 
bins with low 100/tm upper limits and we implement that 
here, extending our masked region to exclude from our anal- 
ysis those lune bins where the 100/tm background flux due 
to Galactic cirrus emission exceeds 15 MJy per steradian: 
with our mask thus extended, our QDOT sample covers 8.84 
steradian (i.e. about 70 per cent of the sky) and comprises 
2053 galaxies. A number of galaxies with 100/tm upper lim- 
its remain even after the mask is extended and we discuss 
what should be done with them in the next subsection. 

2.3 Temperature assignment and subsample 
selection 

Our assignment of dust emission temperatures to galax- 
ies follows the method described by S90. We assume that 
the far-infrared spectrum of each galaxy is well described 
by a single-temperature Planck function multiplied by a 
emissivity, as often considered appropriate for dust: the 
modelling by RRC89 shows this to be a reasonable approxi- 
mation for both 'disc'- and 'starburst'-dominated emission. 
For each source we fit a single-temperature 5^ oc pBv{Tohs) 
curve [where Sv is the flux at frequency p and Bv{T) is the 
Planck function for temperature T] to the spectrum, such 
that its convolution with the response curves for the 60 and 
100 /tm IRAS detectors (Beichman et al. 1988) give the ob- 
served fluxes in those passbands. The emission temperature, 
Tem, is then given by Tem = Tobs(l + z) for a galaxy with 
redshift z. 

We follow S90 in adopting a temperature of 36 K to 
mark the division between the warm and cool samples. This 
is motivated by the RRC89 models, but is also supported 
by the work of Bothun et al. (1989), who show that the 
mean temperatures of their 'normal' and 'active' ('disc' and 
'starburst/Seyfert', respectively, in the parlance of RRC89) 
samples of UGC galaxies observed by IRAS are 35 and 38 K, 
respectively: it must be emphasised, however, that the in- 
trinsic dispersion in galactic properties means that this tem- 
perature cannot be regarded as a sharp cut-off, nor should 
the warm and cool subsamples it defines be thought of as 
comprising totally distinct physical species of galaxy. 
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Table 1. Properties of principal galaxy samples 
sample no. of galaxies mean temp. (K) 
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Figure 2. The variation in the median Sqq / Siqq ratios in radial 
velocity shells. The solid line shows the power law fit used in the 
construction of the warmer and cooler matched n[z) samples. 



There remain some 275 galaxies within the unmasked 
region of the sky with 100/tm upper limits. If the 100/tm 
upper limit for a particular galaxy is sufficiently low com- 
pared to its observed 60/tm flux, then we may safely classify 
it as warm, according to our temperature criterion, but 161 
galaxies remain with uncertain temperature classifications 
in the unmasked region of our QDOT sample after account- 
ing for this. For these galaxies we take the 100/tm upper 
limit as a detection. This is not so questionable an action 
as at first it might seem, as the great majority (141 out of 
161) of these sources were detected at 100/tm in at least one 
IRAS scan and the upper limit quoted is the flux measured 
in this unconfirmed detection. We may be sure that the 94 
galaxies in our QDOT sample classified as warm by this pro- 
cedure must truly belong to that category, as their 100/tm 
fluxes can only be lower and, hence, their temperatures can 
only be higher than we have imagined, leaving us with a to- 
tal of 57 galaxies with doubtful cool classifications (since 10 
galaxies not detected at 100/tm were classified as cool on the 
basis of their upper limits in that passband). Some of these, 
no doubt, belong in the warm sample, but we shall assume 
that this has no bearing on our clustering analysis. We may 
test the validity of this assumption by analysing samples se- 
lected above a higher 60/tm threshold, which will have fewer 
uncertain cool classifications: only 13 problem cool sources 
remain in our QDOT sample when the 60/tm flux threshold 
is increased to 0.7 Jy, for example. 

The radial temperature gradient in QDOT means that 
the warm and cool subsamples created this way have very 
different redshift distributions, making it difficult to distin- 
guish temperature-dependent effects from those which de- 
pend directly on redshift. To circumvent this problem, we 
have constructed a further pair of subsamples in such a way 



warm, temp-selected 1233 

cool, temp-selected 820 

warmer, matched n[z) 1002 

cooler, matched n[z) 1010 

warm, temp, Sqq > 0.7 Jy 997 

cool, temp, Sqo > 0.7 Jy 635 



41.84 
32.04 
41.95 
34.47 
42.19 
32.56 



as to survey the same volume with each. We have done this 
by splitting QDOT into equal sized warmer and cooler sub- 
samples according to the median temperature as a function 
of distance: 

(i) We split the sample into recessional velocity shells of 
width 0.1 dex. 

(ii) We find the median Seo / Sioo ratio for galaxies within 
each shell, where Seo and 5ioo are, respectively, the IRAS 
fluxes in the 60/tm and 100/tm passbands. 

(iii) We fit this median ratio as a power law with dis- 
tance, with the result (Seo/Sioo) = 0.14:2v°'^'^'' , where v is 
the recessional velocity in km s~^. The power law fit for 
temperature versus distance is shown in Fig. 2 and appears 
to be an adequate model, in that the between numbers of 
warmer and cooler galaxies is 21 for 25 degrees of freedom, 
assuming Poisson statistics. 

(iv) We find whether each galaxy is warmer or cooler than 
the median ratio for its distance and put it into the warmer 
or cooler subsample accordingly: these samples have, by con- 
struction, almost identical redshift distributions. 

In Table 1 we list the number of galaxies in each of our 
principal IRAS galaxy samples, together with their mean 
temperatures. In Fig. 3 we plot the temperature distribu- 
tions of the two pairs of samples selected above a 60/tm 
flux limit of 0.6 Jy: note the different temperature scale 
in Fig. 3(c). This figure shows the large degree of overlap 
between the warm (cool) temperature-selected sample and 
the warmer (cooler) matched n{z) sample: the fact that the 
temperature distributions of the corresponding members of 
these pairs of samples are so similar, coupled with the neg- 
ligible difference between the redshift distributions of the 
two matched n{z) samples, means that we can combine re- 
sults from these two pairs of samples to distinguish between 
temperature-dependent effects and sampling or luminosity- 
dependent effects that depend more directly on redshift. 



2.4 Selection functions 

The radial decline in the number density of galaxies in a 
flux-limited sample is quantified by its selection function, 
i/i(r), which we define to be the expected number density of 
galaxies at distance r lying above the sample's flux limit in 
the absence of clustering. 

Throughout this paper we correct radial velocities to 
the centroid of the Local Group by adding to the observed 
heliocentric velocity of a galaxy with Galactic longitude I 
and latitude b the correction term 300 sin(l)cos(6) km s~^ 
and assume an Einstein - de Sitter universe (Qo = 1, A = 0). 

Details of the methods used in the computation of the 



The clustering of warm and cool IRAS galaxies 5 



warm temp — selected. sam.ple 



cool tem.p — selected sam.ple 



XI 

la 



(a) 



<T>=41.a4 



20 



40 



60 



80 



tem.peratu.re (K) 




20 40 60 80 

temperature (K) 



warm matctied n(z) sample 



CN3 



LO 



X! 
CO 

la CD 



o 



10 



(c) 



<T>=41.95 



20 40 60 80 

temperature (K) 



cool matched n(z) sample 



CD 
CD 



X 

*^ < — 1 



o 



<T> = 34.47 



20 40 60 

temperature (K) 



BO 



Figure 3. The temperature distributions and mean temperatures for the four QDOT subsamples selected above a 60/im flux hmit of 
0.6 Jy: (a) warm temperature-selected sample; (b) cool temperature-selected sample; (c) warmer matched n[z) sample; and (d) cooler 
matched n[z) sample. Note the different scale in (c). 



selection functions for our sample will be presented else- 
where (Saunders, in preparation) and we provide only a 
summary here. We use both parametric and non-parametric 
forms for the selection function, which we determine by the 
following procedure: 

(i) The galaxies in a given sample are binned, to gener- 
ate a redshift distribution, ra(j'j), which gives the number 
of galaxies in the bin centred on distance , measured in 
h-^Mpc. 

(ii) A naive estimate of the selection function, i/ii(rj), is 
then obtained by dividing ra(j'j) by the volume of the bin. 



(iii) To account for the clustering in the galaxy distri- 
bution, we divide i/ii(rj) by p{tj), which is the maximum- 
likelihood radial density estimator described by S90. 



(iv) The quantity so obtained [call it (t>2{'rj)] is, however, 
still not an adequate estimate of the selection function, since 
dividing through by p{rj) will remove the effect of num- 
ber density evolution, which S90 showed to be important 
for QDOT. The final estimator for the selection function is 
obtained, therefore, by multiplying i/i2(rj) by the evolution 
factor, p{tj): note that this correction divides out any depen- 
dence of the selection function on assumed K-corrections or 
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Table 2. Selection function parameters for the samples listed in 
Table 1 



sample 


10* (/), 


a 


P 


7 


logio 


QDOT 


22.0 


1.61 


3.90 


1.64 


1.81 


warm, temp-selected 


1.80 


1.91 


4.11 


1.99 


2.09 


cool, temp-selected 


10.4 


1.87 


4.61 


2.42 


1.77 


matched n[z) samples 


11.0 


1.61 


3.90 


1.64 


1.81 


warm, Sqq > 0.7 Jy 


1.88 


1.91 


4.07 


1.94 


2.06 


cool, Sqo > 0.7 Jy 


9.37 


2.05 


4.63 


2.52 


1.74 



cosmological model. Combining these factors gives our final 
non-parametric estimate of the selection function as 



V{ri)p{r,)' 

and this is then normalised as discussed below. 



(1) 



It is found that the selection function so computed 
closely approximates a double power law (Fig. 4), so a con- 
venient parametric form for the selection function is 



0(A) 



10(1 



(1 -I- l0-f^)l3f-f ' 



(2) 



where A = logjo(''/''*), for a galaxy at a distance, r, mea- 
sured in units of Mpc, and i/i,, a, r,, 7 and /? are param- 
eters describing, respectively, the amplitude, nearby slope, 
break position and breadth, and distant slope of the selec- 
tion function. 

The parameters a, r,, 7 and /? are determined by max- 
imising the likelihood quantity 



const X 



n 



Iswith rj. <r„ 



p{rk)/(t>{rk) 



(3) 



where Tmax,i is distance at which each galaxy, of given ob- 
served luminosity, would drop below the flux limit, and p{rk) 
reflects the level of evolution found as above. 

The amplitude, 1/1,, is determined by demanding that 
the Js-weighted integral over the selection function equal 
the Js-weighted sum over the sample: 



dV 



dr (l+4:7rJ3(t>(r)) 



dr 



E 

sources 



1 



(l + 4:7rJ3(t>(rj)) 



(4) 



where is the distance to the j-th source; this sum is easily 
solved by iteration. 

In Table 2 we give the values for these quantities appro- 
priate to our principal samples and in Fig. 4 we show the 
selection functions derived by these methods for the warm 
and cool subsamples derived from QDOT with a 60 fiia cut 
at 0.6 Jy. 



3 REDSHIFT-SPACE AUTOCORRELATION 
FUNCTIONS 

3.1 Method 

We follow Hamilton (1993) in using the estimator 
DaDa(s) ■ RaRa(s) 



1 + {aa(s) 



[DARA{s}f 



(5) 
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Figure 4. Selection functions for subsamples drawn from QDOT 
with a 0.6 Jy flux limit at 60 /im: (a) warm and (b) cool. The solid 
line shows the parametric fit, while the solid and empty circles 
show, respectively, the non-parametric selection function, (l>{r), 
and the naive estimate, (f)i[r). 
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Figure 5. The redshift-space autocorrelation function for our 
QDOT sample. The solid line shows the best fit power law 
(so = 3.86 ± 0.33/1-1 ]y[pp^ ^ _ 1.23 ± 0.08) over the range from 
which it was determined 



Figure 6. The redshift-space autocorrelation functions for the 
warm (stars) and cool (circles) temperature-selected samples. 



to calculate the redshift-space autocorrelation function, 
{aa(s), of galaxies of species A, where DaDa{s) is the 
weighted pair count of A galaxies whose separation places 
them in a bin centred on s, RaRa{s) is the corresponding 
weighted pair count for a random catalogue with the same 
sky coverage and selection function as the galaxy sample and 
DaRa{s) is the weighted count of cross-pairs between the 
galaxy and random catalogues. Hamilton (1993) has shown 
that this estimator is much less sensitive to uncertainty in 
the assumed mean number density of galaxies than the stan- 
dard estimator 

DaDa{s) tor^ 
riA 



1 + {aa(s) 



(6) 



DaRa{s) 

where wajWRa ^rs, respectively, the estimated mean num- 
ber densities of the galaxy and random catalogues. Love- 
day et al. (1995) found that the correlation function of the 
Stromlo-APM survey computed using equation (5) is in good 
agreement with that for a volume-limited sample of the sur- 
vey, while equation (6) yields spurious power on large scales, 
which they explain as resulting from slight differences be- 
tween the radial density distributions of the galaxy and ran- 
dom catalogues. We also find that the standard estimator 
produces more power on large scales, and, sharing the in- 
terpretation of this advanced by Loveday et al. (1995), we 
favour the use of Hamilton's estimator. 

To optimise signal-to-noise, it is necessary to weight the 
galaxies in some way that accounts for the variation in the 
mean galaxy number density with redshift. The simplest 
method (Davis & Peebles 1983) is to weight each galaxy 
by the reciprocal of the selection function at its redshift, 
so that a given volume of space is weighted by the number 
of galaxies expected to occupy it in a volume-limited sam- 
ple. This prescription gives high weights to the most distant 
galaxies in a flux-limited sample, so a distance limit must 
be imposed if the resultant weighted data-data pair count 
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Figure 7. The redshift-space autocorrelation functions for the 
warmer (stars) and cooler (circles) matched n[z) samples. 



is not to be blighted by shot noise. An alternative scheme 
(Efstathiou 1988, Loveday et al. 1992, Saunders, Rowan- 
Robinson & Lawrence 1992), and one which avoids the loss 
of information from galaxies beyond the distance limit, is 
to assign, when computing the pair count of galaxies with 
separation s, the weight 



Mr) = l/[l + 4:7rf(t>(r)J3is)], 



(7) 



to a galaxy at distance r, where (l>{r) is the selection function 
at that distance, / is the sparse-sampling factor (i.e. / = 1 
for a fully-sampled survey and / = 1/6 for QDOT) and 
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Figure 8. Counts-in-cells results for our full QDOT sample, to a depth of 300 h ^ Mpc. In (a) — (e) we show results for (/) in different 
radial shells for cells of size: (a) 1000 km s'^ ; (b) 2000 km s'^ ; (c) 3000 km s'^ ; (d) 4000 km s'^ ; and (e) 6000 km s'^ . Error bars are 
as estimated in the manner described by Efstathiou et al. (1990). In each plot the solid line denotes the maximum likelihood estimate of 
(T^(/) and the dashed line that from Efstathiou et al. (1990). In (f) we plot the maximum likelihood estimate of (t^(/) as a function of /. 
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Ms) 



^{x)x dx. 



(8) 



It can be shown (e.g. Loveday et al. 1995) that this min- 
imises the variance in the correlation function on large scales 
where the 'cluster model' of Peebles (1980) provides a good 
estimate of the uncertainty in the correlation function. The 
utility of this weighting scheme is clear: for small separa- 
tions, where 47r/i/iJ3 ^ 1 and where the contributions to 
the pair count variance are dominated by clustering rather 
than the discreteness of galaxies, the galaxies are weighted 
by the reciprocal of the selection function and, hence, equal 
volumes are weighted equally; while, on large scales, galaxies 
are given equal weight, thus reducing the problem of shot 
noise from a few, highly-weighted, distant galaxies. 

For our model Jsi-s), we used the integral over 
the real space correlation function of QDOT galax- 
ies determined by Saunders et al. (1992), multiplied 
by the linear theory redshift-space correction factor, 
F = [1 + (2Q!5 V36) + (fij Vsfc^)], derived by Kaiser (1987): 
Saunders et al. (1992) suggest that a value of _F = 1.57±0.32 
is appropriate for QDOT, although, in fact, the correlation 
function results are quite insensitive to the exact J3 model 
used. We used 20,000 particle random catalogues through- 
out: tests showed that this was sufficient, as negligible dif- 
ference was found when the number of random particles was 
increased to 100,000. We compute error bars from the scat- 
ter between correlation functions measured in octants. 




Figure 9. Counts-in-cells results for warm and cool QDOT sam- 
ples, limited to a depth of 300 Mpc: the dashed line denotes 
the variation of (t^(/) as a function of / for the cool subsample 
and the solid line that for the warm subsample. The data points 
for the cool sample are slightly displaced horizontally for the sake 
of clarity and error bars are estimated in the manner described 
by Efstathiou et al. (1990). 



3.2 Results 

We first computed the autocorrelation function of 
our QDOT sample, which is shown in Fig. 5. The 
best fit power law of the form {(s) = (so/s)"' 
over the range 1 < s < 20 /t"^ Mpc was found to be 
So = 3.86 ± 0.33 Mpc, 7 = 1.23 ± 0.08, which is in good 
agreement with the results of Moore et al. (1994), who found 
So = 3.87 ± 0.32 h-^ Mpc, 7 = 1.11 ± 0.09 for their QDOT 
sample, which has a slightly different mask. In Fig. 6 we 
plot the autocorrelation functions of the warm and cool 
temperature-selected samples: very similar results were ob- 
tained from warm and cool samples selected above a 60/tm 
flux limit of 0.7 Jy and when, instead, we used the non- 
parametric selection function, justifying our procedure for 
treating the galaxies with 100/tm upper limits and our choice 
of parametric form for the selection function. From Fig. 6, 
we see that the warm sample is clearly more strongly clus- 
tered on large scales: there is a 4(7 difference in the corre- 
lation functions over the range 17 <s <50h-^ Mpc. This 
difference vanishes, however, when we come to the autocor- 
relation functions of the warmer and cooler samples with 
matched redshift distributions (Fig. 7), which do not differ 
by as much as 2a in the full range 1 < s < 100 Mpc: this 
strongly suggests that the principal cause of the difference 
in clustering strength seen in Fig. 6 is a sampling effect, re- 
sulting from the different redshift distributions of the warm 
and cool samples, rather than a temperature-dependent one. 



4 CELL COUNT VARIANCES 

An alternative statistic for quantifying the clustering of a 
sample of galaxies is the cell count variance (Peebles 1980; 



Efstathiou et al. 1990; Loveday et al. 1992). The principal 
advantage that counts-in-cells analysis has over analyses us- 
ing correlation functions is that, in principle, by considering 
the variance between cells at the same redshift, no account 
need be taken of the decline in the mean density with red- 
shift in a flux-limited sample. The counts-in-cells analysis 
of this subsection is, therefore, intended to complement the 
correlation function analysis discussed above and, in partic- 
ular, to ensure that the results obtained above reflect the 
real clustering of the IRAS galaxy samples and are not arte- 
facts produced by their differing selection functions. The 
method of counts-in-cells analysis is described by Efstathiou 
et al. (1990) and we refer the reader to that paper for a full 
discussion of it. Oliver et al. (1995) have recently pointed 
out that the method of Efstathiou et al. (1990) produces 
biased estimates of cell count variances, as a result of its 
assumption that the galaxy selection function is constant 
throughout a particular cell, whereas, in fact, the decline in 
the selection function means that galaxies are more likely 
to be located towards the inner edge of any cell. This bias 
can be problematic when observational cell count variances 
are being compared with theoretical predictions, but it is 
of little concern to us here, as the differential bias caused 
by the difference in the selection functions of our warm and 
cool samples is likely to be much smaller than the uncer- 
tainties in our estimations of the cell count variances, given 
the relatively small size of our galaxy samples: we have not, 
therefore, included the volume-limiting correction of Oliver 
et al. (1995) in the counts-in-cells analysis whose results are 
presented here. 

The first results of our counts-in-cells analysis are shown 
in Fig. 8, where we present results for our full QDOT sam- 
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Figure 10. The projected cross-correlations of warm (stars) and 
cool (circles) temperature-selected samples of QDOT galaxies 
with the QIGC. 

pie. This figure compares our results with those obtained 
by Efstathiou et al. (1990) and shows that the presence of 
erroneous redshifts in the earlier version of the QDOT cat- 
alogue used in that paper had no bearing on the counts-in- 
cells analysis performed there. This figure also shows that 
the values of cr^ (l) for a given I vary greatly between radial 
shells and each has a large uncertainty associated with it. 
The situation is even worse once we come to split QDOT 
into warm and cool samples or reduce the galaxy numbers 
by considering higher 60/tm flux limits. This leaves us very 
reliant on the ability of the maximum likelihood mechanism 
to extract the correct solution for cr^ (l) as a function of I. We 
may try to facilitate this by excluding those shells where the 
counts are low, but we must inevitably contend with large 
uncertainties resulting from our small sample sizes. 

In Fig. 9 we present results for (T^(I) as a function of I 
for the warm and cool temperature-selected subsamples lim- 
ited to a depth of 300 Mpc. This figure shows that the 
(7^(1) — I relations for warm and cool subsamples of QDOT 
galaxies are consistent within their errors. Very similar re- 
sults are obtained using QDOT galaxies out to different 
depths, selected above different 60/tm flux limits and for 
the warmer and cooler samples with matched redshift dis- 
tributions. 



5 PROJECTED CROSS-CORRELATIONS 
WITH QIGC 

The redshift-space tests carried out above suffer from poor 
signal-to-noise. In order to improve signal-to-noise, we have 
used the cross-correlation technique of Saunders et al. (1992) 
between warm and cool temperature-selected redshift sub- 
samples and the whole 2-D QIGC parent catalogue from 
which they were drawn. This measures H((7), the integral of 
{(r) along a ray with projected separation a: we refer the 
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Figure 11. The projected cross-correlations of warmer (stars) 
and cooler (circles) matched samples of QDOT galaxies with the 
QIGC. 



reader to Saunders et al. (1992) for a full discussion of the 
method employed here to estimate H((7). The results shown 
in Fig. 10 show clear evidence for enhanced large scale clus- 
tering for the warm subsample as compared with the cool 
sample: as discussed by Saunders et al. (1992), the absolute 
level of H((7) on large scales may not be accurately estimated 
by this method, due to the breakdown of several assump- 
tions made in its derivation (see Saunders et al. 1992 for 
more details), but the difference between the values for the 
different subsamples should just result from shot noise if the 
subsamples were drawn from the same population. 

To assess whether this difference might just result from 
the different volumes surveyed by the temperature-selected 
samples, we then repeat the cross-correlation analysis with 
each of the matched subsamples against QIGC. The result 
is Fig. 11 and, although much weaker, it appears that the 
evidence for stronger large-scale clustering of the warmer 
galaxies remains: since the n{z) distributions are now es- 
sentially identical, the difference cannot be due to volume 
effects. We have, therefore, a marginal detection, at the few 
sigma level, of a real difference in the large-scale clustering 
of warm and cool IRAS galaxies. 



6 DIFFERENTIAL DENSITY MAPS OF IRAS 
GALAXIES 

In this Section we test the null hypothesis that the density 
pattern mapped by each of the warm and cool galaxy sub- 
samples is the same: i.e. that they are random samples of 
the same underlying density field. Variations between the 
two fields should be accounted for by Poisson statistics. 

To test this, we construct the reduced statistic for 
two populations of density p and p' , with shot noise variances 
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Figure 12. The density difference for tlie warm and cooi 
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Figure 13. The redshift-space correlation function, of 
warm temperature-selected sample of QDOT galaxies. The con- 
tour levels are arithmetically spaced with a separation of 0.1: the 
dashed line is the contour for ^ = 0.0, while the solid line marks 
that for C = 1-0. 



Figure 14. The redshift-space correlation function, ^('^j^)? of 
cool temperature-selected sample of QDOT galaxies. The con- 
tour levels are arithmetically spaced with a separation of 0.1: the 
dashed line is the contour for ^ = 0.0, while the solid line marks 
that for C = 1-0. 



n ^ 



(9) 



where the density field, p is calculated on a grid by 



where the sum is over all galaxies in the sub- 



sample in the i"^ cell, and each galaxy is given the unbiased 
weighting Wj = The mean density is normalised to 

unity. Each cell contributing to the reduced is weighted 
by the shot noise variance of the cell. The summation in 
equation (9) is taken over the n non-empty cells containing 
both warm and cool IRAS galaxies. 

To speed up calculation, we estimated the shot noise 
variance for each cell at a distance from the origin by 
integrating the appropriate weight over a Gaussian cell; 



X^=l + (V2^i?)" 



d^rw{r)exp{-{r -r, f/2R'^). (10) 



Here R is chosen to match the size of a cubic cell. Assuming 
that the weighting scheme can be approximated as a power 
law across the cell, we find that 



A, 



1+ 



2/2 



/27r 



dx x''' 



2/2 



,(11) 



where u = r,/R, and a = din w{r)/dlnr. This last integral 
is related to the Parabolic Cylinder function, Da{u), but 
in practise it is easier to calculate numerically. In principle, 
this variance should be equal to l/(ra)ceii, where (ra)ceii is 
the expected galaxy occupancy for the i"^ cell. However, this 
can be poorly determined due to the finite sampling of the 
cell, so instead we use the expected value for a homogeneous 
background. 

If we assume that the difference has only Poissonian 
uncertainty, the variance on each estimate of x^ can easily 
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be evaluated. For samples with the same underlying mean 
density, we calculate that 



2 

a^2 



4E 

8 = 1 



(a» + a:-2)2 



+ 3 



(12) 



To proceed, we split the IRAS galaxies into warm and 
cool subsamples in such a way that both samples had the 
same selection function (see Section 3). This has the advan- 
tage of maximum possible overlap between the catalogues, 
and avoids the spurious effects of an underlying differential 
density gradient. 

The density field for each sample was calculated over 
three sizes of cube, i?box = 4, 10 and 50 x lO'^kms"^. 
Each cube was divided up into cells of size rceii = -Rbox/5, 
-Rbox/10, i?box/15, and i?box/30. This resulted in some over- 
lap between the sizes of cells used to calculate the density, 
allowing us to check for consistency between differently sam- 
pled volumes. The choice of three sizes of cube allowed us 
to measure the density fields over a wide range of scales, 
without wasting time over sparsely sampled cells. 

Fig. 12 shows the results of applying this test to 
the matched n{z) QDOT samples. On scales larger than 
10/t"^Mpc, we see that the two density fields sample the 
same pattern in a manner consistent with the Poisson sam- 
pling hypothesis. Only five cells contribute to the value of 
for the smallest cell size (rceii = 1.3/t"^ Mpc), so the roisson 
error bar assigned to that data point will be a serious un- 
derestimate of the true uncertainty, and so the highly signif- 
icant difference between the redshift-space distributions of 
the warm and cool samples it purports to show should be re- 
garded as spurious. The assumed Gaussian statistics should 
hold for the larger cell sizes, for which there are many more 
usable cells, so the significant deviation of the reduced 
statistic from unity for 2 < rceii/h~^ Mpc < 10, should be 
taken as direct evidence for a difference between the redshift- 
space clustering patterns of warm and cool IRAS galaxies: 
the origin of this difference is a tendency for smaller cells 
(''cell < 10/t~^Mpc) to have larger numbers of warm galax- 
ies than cool ones. 



7 STRUCTURE IN THE REDSHIFT-SPACE 
CORRELATION FUNCTION 

If, as our model and the results of the previous Section sug- 
gest, warm IRAS galaxies are preferentially located in richer 
environments than cool ones, we might expect to see a re- 
sultant difference in the structure in the redshift-space cor- 
relation functions, {(cr, tt), of the two samples: we look for 
such a difference in this Section. 

We compute {(cr, tt) using the estimator 



1 + i{a,7r) 



DaDa{t, tt) ■ RaRa{t, tt) 
[DaRa{(t, 7r)]2 



(13) 



with, as before, 20,000-particle random catalogues and a 
minimum variance weighting scheme, employing an ap- 
propriate model J3(s). We use separation bins of width 
1.0 Mpc in both directions, over the range to 
30/t~"'Mpc. The raw {(cr, tt) data are very noisy, so we have 
smoothed them twice with a Shectman (1985) filter (a 1-2-1 
boxcar in two dimensions) to produce the plots shown here. 
In Figs. 13 and 14 we plot the smoothed {(cr, tt) results 
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Figure 15. The redshift-space correlation function, ^((T, tt), of 
warmer matched n[z) sample of QDOT galaxies. The contour lev- 
els are arithmetically spaced with a separation of 0.1: the dashed 
line is the contour for ^ = 0.0, while the solid line marks that for 

C = 1.0. 



for the warm and cool temperature-selected samples. It is 
interesting to note that the structure in {(cr, tt) is far more 
pronounced in the plot for the warm sample than that for 
the cool sample: the same structures can often be seen in 
both plots, but with a higher amplitude in the warm plot, 
while in other cases, a strongly correlated region in the plot 
for the warm sample corresponds to an anti-correlated region 
in that for the cool sample. We show the corresponding plots 
for the warmer and cooler matched n{z) samples in Figs. 15 
and 16, where, again, the plot for the warmer sample exhibits 
more structure, albeit in a much less pronounced way than in 
Figs. 13 and 14. Figs. 13 - 16 together give some qualitative 
support to our model in which more warm than cool IRAS 
galaxies are found in richer environments, although, clearly, 
the increased concentration into richer environments is more 
a function of redshift than of temperature. 



8 CROSS-CORRELATIONS WITH ABELL 
CLUSTERS 

A direct way to probe whether warm IRAS galaxies are 
preferentially located in richer environments than cool ones 
would be to study the cross-correlations between the warm 
and cool samples and a suitable cluster catalogue. Unfortu- 
nately, if we were to restrict our warm and cool QDOT sam- 
ples to the areas of the objective cluster catalogues drawn 
from the APM (Dalton et al. 1992) and EDSGC (Lums- 
den et al. 1992) galaxy catalogues we would have too few 
objects left to conduct the desired analysis. Since many 
Abell clusters (particularly those in the R=0 class) are sus- 
pected of being chance alignments of galaxies along the line 
of sight, rather than true rich clusters (e.g. Lumsden et al. 
1992), our aim in calculating cross-correlations with Abell 
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Figure 16. The redshift-space correlation function, ^((T, tt), of 
cooler matched ri(^) sample of QDOT galaxies. The contour levels 
are arithmetically spaced with a separation of 0.1 : the dashed line 
is the contour for ^ = 0.0, while the solid line marks that for 

C = 1.0. 



Figure 18. The cross-correlations of warm (stars) and cool (cir- 
cles) temperature-selected samples of QDOT galaxies with Abell 
clusters: (a) the full > sample (filled symbols) and (b) the 
R y 1 subsample (open symbols). 
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Figure 17. The cross-correlations of QDOT galaxies with Abell 
clusters: (a) the full i? > sample (filled symbols) and (b) the 
i? > 1 subsample (open symbols). 



Figure 19. The cross-correlations of warmer (stars) and cooler 
(circles) matched n[z) samples of QDOT galaxies with Abell clus- 
ters: (a) the full i? > sample (filled symbols) and (b) the i? > 1 
subsample (open symbols). 



clusters in this Section is more modest: we simply wish to 
see whether the increasing strength of cross-correlations be- 
tween Abell clusters and QDOT galaxies that Mo, Peacock 
& Xia (1993) see as they exclude QDOT galaxies with red- 
shifts first z < 0.02 and then z < 0.03 can be accounted for 
by our temperature-dependent clustering model or whether 
it is a sampling effect. 

Our Abell cluster samples are drawn from those of 



Peacock & West (1992), with some updated redshift in- 
formation (M. West, private communication). The Pea- 
cock & West (1992) sample covers the region of the sky 
at Galactic latitudes \b\ > 25°, with the additional ex- 
clusion of the regions RA=3'^ — 6*^, Dec.=0° — 35° and 
RA=15'' - 18'', Dec. = -30° - 0°, and covers the redshift 
range 0.01 < z < 0.08. If we restrict this sample to the area 
not excluded by our IRAS mask, we obtain a volume-limited 
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Abell cluster sample covering 6.5 steradian, containing 220 
R = and 205 R > 1 clusters, which is about 90% complete 
in spectroscopic redshifts. The IRAS samples are reduced, 
too, by their restriction to the region allowed by Peacock 
& West (1992): the warm and cool temperature-selected 
samples fall to 691 and 409 galaxies, respectively, while the 
warmer and cooler matched samples now contain, respec- 
tively, 550 and 549 galaxies. 

We compute the cross-correlation functions using the 
estimator 



1 + {ab(s) 



DaDb{s} RaRb{s} 



DaRb{s) RaDb{s) 



(14) 



where the meaning of the terms is clear from the discussion 
following equation (5). This is clearly a generalisation of the 
autocorrelation function estimator (equation 5) devised by 
Hamilton (1993) and, as we show in the Appendix, it shares 
with it the virtue of having an uncertainty which is second 
order in the uncertainties of the mean number densities of 
species A and B, as well as possessing the desirable quality 
of symmetry under the exchange of labels A and B. For these 
reasons we favour its use over that of the standard estimator 

DaDb{s) ur^ 
DaRb{s) ua 



1 + {ab(s) 



(15) 



which has an uncertainty that is first order in the uncer- 
tainty in the mean number density of B galaxies, as well 
as being asymmetric under exchange of A and B, which 
is aesthetically unpleasing in an estimator of an essentially 
symmetric quantity such as a cross-correlation function. As 
before, we use 20,000-particle random catalogues, employ 
the minimum variance weighting scheme with an appropri- 
ate Jsis) and compute error bars from the scatter between 
cross-correlation functions measured in octants. 

In Fig. 17 we present the cross-correlation function 
for our full QDOT samples and our Abell cluster sam- 
ples, both the full R > sample and its i? > 1 sub- 
sample. The best-fit power law for the cross-correlations 
with the R > 1 case over the range 3 < s < 30 Mpc 
is So = 10.01 ± 0.80 Mpc, 7 = 2.04 ± 0.23, which is in 
good agreement with the results of Moore et al. (1994), who 
found So = 10.10 ± 0.45 h'^ Mpc, 7 = 1.75±0.10 for slightly 
different galaxy and cluster samples. Figs. 18 and 19 present 
the corresponding plots for the warm and cool temperature- 
selected and warmer and cooler matched QDOT samples 
respectively. They clearly show that the clustering strength 
enhancement observed by Mo et al. (1993) must have its 
origin in a sampling effect, as the two pairs of warm and 
cool cross-correlation functions differ by less than 2a over 
the full range (1< s < 100 Mpc) studied. 



9 DISCUSSION 

In this paper we have questioned whether there might be 
a connection between the radial temperature and cluster- 
ing strength gradients observed in the QDOT IRAS galaxy 
redshift survey. We showed how simple ideas of interaction- 
induced star formation in IRAS galaxies might produce such 
a connection and we proceeded to look for evidence for it, 
through a series of statistical tests of the clustering of warm 
and cool subsamples drawn from the QDOT redshift survey. 
We obtained somewhat equivocal results. A significant 



difference was detected between the autocorrelation func- 
tions of the samples for separations 17 < s < 50 Mpc, 
but this disappeared when we considered samples with 
matched redshift distributions, which suggests that the dif- 
ference is caused by a sampling, rather than a temperature- 
dependent effect, given the similarity between the red- 
shift distributions of the corresponding members of the 
temperature-selected and matched n{z) pairs of samples. 
A counts-in-cells analysis found no difference between the 
cell count variances of warm and cool samples. In an at- 
tempt to improve upon the signal-to-noise levels obtained 
in these redshift-space clustering tests, particularly the last 
mentioned, we computed the projected cross-correlations of 
the warm and cool samples with the QIGC, from which the 
QDOT redshift survey was drawn. We found the warmer 
sample to have stronger cross-correlations on large scales, 
and this persisted, albeit only at a marginal significance, 
when we removed sampling effects by studying the matched 
n{z) samples. We directly tested the null hypothesis that the 
galaxies in the warmer and cooler matched n{z) samples are 
drawn from the same population in redshift-space and were 
able to rule it out on small scales, finding that cells of size 
< 10h~^ Mpc tended to have a significantly larger number 
of warmer galaxies than cooler ones. The structure in the 
{((7, 7r) plots for the warm and cool temperature-selected 
samples appeared to support the interpretation of these re- 
sults as indicating that warmer IRAS galaxies are preferen- 
tially located in richer environments, in accordance with our 
interaction-induced model for the origin of their 60/tm emis- 
sion, but the corresponding plots for the matched n{z) sam- 
ples revealed that the greater concentration of warm galax- 
ies in richer environments results more from the region of 
space sampled by the warm sample than from any directly 
temperature-dependent effect. We advocated the use of a 
new estimator for cross-correlation functions and applied it 
to the computation of the cross-correlations between our 
galaxy samples and a volume-limited sample of Abell clus- 
ters and this, too, suggested that the origin of the cluster- 
ing strength gradient detected in QDOT by several previous 
authors lies in a sampling effect, rather than a temperature- 
dependence to the clustering of IRAS galaxies. 

That significant sampling effects exist on the scale of 
the QDOT survey is, perhaps, surprising, especially since 
IRAS galaxies avoid rich clusters, so their clustering will 
be less sensitive to local variations in the cluster number 
density than that of optical galaxies: these effects should 
be taken into account when comparing theoretical predic- 
tions with clustering data from QDOT, especially on large 
scales. Only the higher signal-to-noise analysis possible with 
the PSC-2 IRAS redshift survey will reveal whether the 
temperature-dependent clustering component for which we 
claim a marginal detection here is of sufficient magnitude 
to warrant similar treatment. It serves, however, as an ex- 
ample of a type of effect that will becoming increasingly 
important in the study of large-scale structure in coming 
years. As the size of available redshift surveys increases by 
an order of magnitude it will become possible, and neces- 
sary, to consider in detail the clustering of subsamples of 
galaxies selected according to astrophysical criteria such as 
luminosity, morphology, spectral type, etc: the investigation 
of the coupling between astrophysical and clustering proper- 
ties will not only be desirable for the correct comparison of 
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observation and theory, but it may also shed invaluable light 
on the nature of bias, through the relationships between dif- 
ferent types of galaxy and their environment. 

The clustering analysis presented here can only indi- 
rectly test our interaction-induced star formation model for 
coupling the temperature and clustering strength gradients 
and, as we have seen, has produced only a marginal level 
of support for it. A more direct test can be made by quan- 
titatively studying the environments of a large sample of 
IRAS galaxies selected over narrow ranges of redshift and 
60/tm luminosity, but with a wide range of 60/tm to 100/tm 
flux ratios: two of the present authors (RGM and ANT) will 
soon be reporting on exactly such a study (Goldschmidt et 
al., in preparation), which should reveal whether the link 
between interactions and enhanced far-infrared luminosity 
that exists for the most luminous IRAS galaxies continues 
down the 60/tm luminosity function as far as the moderate 
luminosity galaxies that dominate redshift surveys of IRAS 
galaxies. 



10 CONCLUSIONS 

We have advanced a scenario in which the increase in cluster- 
ing strength with redshift observed in flux-limited redshift 
surveys of IRAS galaxies arises as a result of the temper- 
ature gradients in the surveys, through a model in which 
the 60/tm emission from IRAS galaxies has its origin in 
interaction-induced formation of massive stars. We have ob- 
tained some support for this scenario from clustering anal- 
yses of temperature-selected samples of QDOT galaxies, 
but we have shown that (at separations > 10h~^ Mpc at 
least) the dominant component to the increase of cluster- 
ing strength with redshift is a sampling effect, reflecting the 
local cosmography in the outer reaches the QDOT survey, 
which includes many rich clusters. This "cosmic variance" 
effect should be borne in mind by those using flux-limited 
samples of IRAS galaxies in large-scale structure studies. 
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APPENDIX A: A NEW CROSS-CORRELATION 
FUNCTION ESTIMATOR 

The standard estimator for the cross-correlation function, 
{ab(s), of galaxies of species A and B takes the form 

Da-Db(s) ra_RB 

KB 



1 + {ab(s) 



(Al) 



DaRb{s) 

where the meaning of the terms follows from the paragraph 
below equation (5), of which equation (Al) is clearly the 
analogue. This estimator, like its analogous autocorrelation 
function estimator, suffers from having an uncertainty which 
is first order in the uncertainties in the mean number densi- 
ties of the samples of A and B galaxies. It is also asymmet- 
ric under exchange of the labels A and B, which is clearly 
unattractive in an estimator for a symmetric beast like a 
cross-correlation function. These two facts combined can, in 
practice, produce the situation where equation (Al) and the 
estimator derived from it by the interchange of A and B can 
yield significantly different cross-correlations, even for the 
case of roughly equally-sized samples of A and B objects, 
where there is no a priori reason to favour one estimator 
over the other. 

This undesirable situation may be avoided through the 
use of the following estimator: 



1 + {ab(s) 



DaDb{s) RaRb{s) 



(A2) 



DaRb{s) RaDb{s)' 

If A=B this clearly reduces to Hamilton's estimator for the 
autocorrelation function and, as we shall now show, it shares 
with it the property of having an uncertainty which is second 
order in the uncertainty in the mean number densities of the 
galaxy samples, as well as being manifestly symmetric under 
the exchange of labels A and B. 

In the notation of Hamilton (1993), to which we refer 
the reader for the background to the following derivation and 
the notation in which it is written, equation (A2) becomes 



1 + {ab(s) 



{NaNb) {WaWb) 
(NaWb) ' (WaNb)' 



(A3) 



where Na denotes iV^g galaxies of species A weighted in 
some fashion and Wa is the "catalogue window", which is 
the selection function i/i* of A galaxies weighted in the same 
way as the A galaxies themselves: the angular brackets de- 
note averages taken over all points in the catalogues with 
separations that place them in the bin centred on s. 

Following Hamilton (1993), we define a pair window, 
, and a point window, Wi , as follows: 

wAB _ AB ,A ,B / \ A \ 

Wi2 = Wi2 <t>l <t>2 (A4) 

and 

Wt = wt(t>t, (A5) 

where and Wi are, respectively, pair and point weights: 
note that the pair weighting need not be separable and so 
need not equal the product of the point weights Wi 
and W2 ■ If the is the true space density of A galaxies at 
some point labelled 1, then, by the definition of the selection 
function, iV^g = <j)^ (neglecting a constant factor that 
cancels out in all the following analysis) and the quantities 
NaNb, NaWb and WaWb become 



^Va^Vb 



AB jfA jfB 



TirAB A B 
W^2 «1 «2 : 



(A6) 



^A^B = mi' NXs.I 4>2 = Wi]^ 111 , 



(A7) 



WaWb = w'^2 4>i4>2 = ■ (A8) 

It follows, therefore, that equation (A3) may be written as 



1 + {ab(s) 



(A9) 



Now, if we define 8i to be the true overdensity in A galax- 
ies at point 1 and n to be their true mean density, then 
equation (A9) becomes 



1+{ab(s) 



(p^AB^A (1 + 8t))(Wlfn^ (1 + 8^)) 



(AlO) 



If we then follow Hamilton (1993) in defining the cross- 
correlation function between the A galaxies and the B cat- 
alogue to be 



,AB _ (W^lT^f 
Vl2 



(All) 



and the windowed cross-correlation function of A and B 
galaxies (which is the best estimate of the true cross- 
correlation function obtainable from the galaxy samples), 
to be 



(A12) 



MB _ {Wi2 ^1 ^2 ) 

- (wif) ' 

we see that the estimated cross-correlation function reduces 
to the form 

gAB(,)_^A(^)^B(^) 



?ab(s) 



(A13) 



Now, since (Hamilton 1993) 0{8^ ), where 

8^ = {W^8t}/{W^}is the mean overdensity of A galaxies 
in the catalogue, it follows that the uncertainty in {ab (s) 
is second order in the uncertainties in the mean densities, 
which is better than the standard estimator, which would 
become, in this notation. 



?ab(s) 



|ab(s)+ [^B(s)-«B] -^A(s)«^ 



which is first order in 8^ . 



(A14) 
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